Profiling Distributed File Systems with Computer Animation

نویسندگان

  • Andrew Leung
  • Eric Lalonde
  • Jacob Telleen
  • Crystal Lee
چکیده

Achieving performance, reliability, and scalability has proven difficult for distributed file systems. Placement of data, load distribution and other overheads are often the culprits. Profiling is a useful technique for understanding file system behavior, improving performance and debugging problems. Existing file system profiling methods often examine fine-grained system activity, such as the path of a single request. This makes understanding interactive relationships difficult. Also, current profiling techniques do not provide a way for users to easily view full system behavior, often relying on logs, statistics, or simple graphs. We present a distributed file system profiling method based on clear-box profiling and visualization of full system behavior. Our approach allows for portable, low-overhead profiling and provides users with real-time animation of system behavior. We present users with a visualization of the file system architecture and animate behaviors, such as load distribution, data location, and network traffic, under real workloads. This allows users to visually identify bottlenecks, latencies, characterize system state and I/O requests, and debug problems. We evaluate the overhead and effectiveness of our profiling method on the Ceph petabyte-scale, parallel file system. Our evaluation shows the visualization client is able to profile each device with very minimal overhead. Additionally, We demonstrate the usefulness of our profiling and visualization techniques through a series of profiles on Ceph. Using these profiles, we have discovered several interesting and important problems ranging from inefficiencies, namely in small I/O operations, to major issues, such as instabilities in the Ceph messaging layer.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

E2DR: Energy Efficient Data Replication in Data Grid

Abstract— Data grids are an important branch of gird computing which provide mechanisms for the management of large volumes of distributed data. Energy efficiency has recently emerged as a hot topic in large distributed systems. The development of computing systems is traditionally focused on performance improvements driven by the demand of client's applications in scientific and business domai...

متن کامل

P2P Network Trust Management Survey

Peer-to-peer applications (P2P) are no longer limited to home users, and start being accepted in academic and corporate environments. While file sharing and instant messaging applications are the most traditional examples, they are no longer the only ones benefiting from the potential advantages of P2P networks. For example, network file storage, data transmission, distributed computing, and co...

متن کامل

Applicability of parallel file systems for technical simulations: a case study

The lack of balance in processor speed and input/output performance of modern computers presents a challenging problem in high performance computing. Parallel and distributed file systems are one of the solutions to this problem because of their ability to distribute the input/output load over the network and improve the performance of clusters in order to meet the demands of applications for t...

متن کامل

SPARC SPARC Station VAX

With the increasing availability of multiprocessor platforms based on dierent types of archi-tectures (shared memory, distributed memory, network based), users will be increasingly faced with heterogeneous and distributed multiprocessor computing facilities. Programming environment concepts have to be found which enable the user to program and use these heterogeneous computing resources, consis...

متن کامل

A Classi cation of File Formats for Animation

This paper proposes a classiication of existing le formats for computer animation. File formats for computer animation store and describe the evolution of virtual scenes over time. They permit applications, which produce animating scenes, to share and to exchange information. File Formats are classiied by the information they store and by what means they store the information.

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007